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Abstract 

An approach to induction is presented, based on the idea of analysing the 
context of a given problem into 'circumstances' . This approach, fully Bayesian 
in form and meaning, provides a complement or in some cases an alternative to 
that based on de Finetti's representation theorem and on the notion of infinite 
exchangeability. In particular, it gives an alternative interpretation of those for- 
mulae that apparently involve 'unknown probabilities' or 'propensities'. Vari- 
ous advantages and applications of the presented approach are discussed, espe- 
cially in comparison to that based on exchangeability. Generalisations are also 
discussed. 

PACS numbers: 02.50.Cw,02.50.Tt,01.70.+w 
MSC numbers: 03B48,60G09,60A05 



Note, to head off a common misconception, that this is 
in no way to introduce a "probability of a probability". 
It is simply convenient to index our hypotheses by 
parameters [. . . ] chosen to be numerically equal to the 
probabilities assigned by those hypotheses; this avoids 
a doubling of our notation. We could easily restate 
everything so that the misconception could not arise; it 
would only be rather clumsy notationally and tedious 
verbally. 

E. T. Jaynes, Monkeys, kangaroos, and N |Jl|], p. 12 



1 Dramatis personae, notatio, atque philosophia 

We continue here our exploration of the notion of 'circumstance' and of its applica- 
tions in plausibility theory. Through this notion we shall here approach no less than 
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the question of induction, i.e., of the prediction of unobserved evens from knowl- 
edge of observed (similar) ones. The study can be read independently of the previous 
one [§], although the two elucidate each other. 

The following characters appear in this study, the first two having a walk-on part 
only: 

The 'propensitor': a scholar who conceives of a 'physical disposition' of some 
phenomena to occur with definite relative frequencies, and call this physical charac- 
teristic 'propensity'. This scholar sometimes makes inferences about propensities by 
means of probability as degree of belief. 

The 'frequentist ': a scholar who conceives probability as 'limit relative fre- 
quency' of a series of 'trials' (or something like that). Frequentists are often propen- 
sitors at heart. 

The 'de Finettian': a scholar who conceives probability as 'degree of belief, 
pedantically insisting on the subjective nature of all plausibility assignments, and 
through the notion of infinite exchangeability makes sense of the frequentist's and 
propensitor's voodooistic practises. The views and tools of this scholar [e.g., |j[ ^ ^ 
||, 0, ||, ^, 1C, 11 ] are supposed known to the reader. 

The 'plausibilist' or 'probability logician', who in this study will be your humble 
narrator: a scholar who conceives of probability as a formalisation and schematisa- 
tion of the everyday notions of 'plausibility' and 'probability' — just as truth in for- 
mal logic is such a formalisation and schematisation of the everyday notion of 'truth' 
— and does not dwell more than so in its meaning. The views of this scholar are a dis- 
tillate of the philosophies and views and/or the works of Laplace [12], Johnson [13], 
Jeffreys [M pL 0, Cox [fil, ph, Jaynes All K)l EBj, Tribus [g], de Finetti [§, | 



Adams @ 0], Hailperin g5fTand others [e.g7|ir© W\ STl! g fl], |2 



34, 35, B6, 37, 38, BJ, 40]. 



Plausibilists agree with de Finettians on basically all points. The only difference 
is in emphasis: although they recognise the 'subjective' character of initial plausibil- 
ity assignments — i.e., that these are matter of convention — , they also think that 
it is not such a big deal. It should in any case be no concern of plausibility theory, 
which is 'impersonal in the same sense as in ordinary logic: that different people 
starting from the same [assignments] would get the same answers if they followed 
the rules' (Jeffreys [16], reinterpreted). This will surely be self-evident when plausi- 
bility theory will reach its maturity, just as it is self-evident in formal logic today. In 
fact, the same subjective, conventional character is present — no more, no less — in 
formal logic as regards the initial truth assignments, those which are usually called 
'axioms' or 'postulates'. Albeit in formal logic there is perhaps less place for dis- 
agreement amongst people about initial truth assignments, the possible choice being 
there between 'true' and 'false', or '0' and '1' only — and not amongst a continuum 
of possibilities [0, 1] as in plausibility theory. 

The above remark is not meant to diminish the historical importance of de Fi- 
netti's Nietzschean work in plausibility theory. Emphasis on subjectivity may still be 
necessary sometimes. In this study we shall also use terms like 'judgement' to this 
purpose. 
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The following notation will be used: 

P(A|fi) denotes the plausibility of the proposition A conditional on B or, as we 
shall also say, given B or in the context B. The proposition B is supposed to express 
all knowledge (including beliefs), data, and 'working hypotheses' on the grounds of 
which we make the plausibility assignment for A. We are intransigent as regards the 
necessity and appropriateness of always specifying the context of a plausibility (as 
well as that of a truth!), 1 a point also stressed by Jeffreys [14, 15], Cox [17, 18], 
Jaynes ||, |TJ], Tribus Q, de Finetti @ § 4], Hajek and apparently 
Keynes [I2 - 



A plausibility density for propositions A x concerning (in a limit sense) a contin- 
uous parameter x is denoted by p(A x \ B), and is as usual implicitly defined by 



-L 



p(A x \ B) dx for all appropriate E; 



(1) 



in general, x h-> p(A x | B) is a generalised function, here always intended in the sense 
of Egorov ^ §|, ||| (see also g || |55U56|]). 2 

The remaining notation follows ISO ^8] and ANSI/IEEE ^ standards. 



2 General setting of the question 

The question that we are going to touch is, in very general terms, that of induction: In 
a given situation, given a collection of observations of particular events, assign the 
plausibility for another collection of unobserved events. Note that temporal distinc- 
tions are not relevant (we did not say 'past' or 'future' events) and we shall not make 
any. Of course, in stating this general question we have in mind 'similar' events. 3 So 
let us call these events 'instances of the same phenomenon', following de Finetti's 
appropriate terminology [^, |], 48]. We also suppose to know that each such instance 



can 'manifest itself in a constant (and known) number of mutually exclusive and 
exhaustive 'forms', and there is a clear similarity between the forms of each instance 
(that is one of the reasons we call the events 'similar'). 

All this can be restated and made more concrete using a terminology that is nearer 
to physics; but we must keep in mind that the setting has not therefore become less 
general. We call the phenomenon a 'measurement', 4 and its instances 'measurement 
instances'. The forms will be called '(measurement) outcomes'. We can finally 
state our question thus: In a given situation, given the observed outcomes of some 
instances of a particular measurement, assign the plausibility for the unobserved 
outcomes of some other instances of that measurement. 



'The context may also have a clarifying role in formal logic. Cf. e.g. the studies by Adams [ p3[ |24| , 
0|, Lewis [0 0], Hailperin [§|, 0, Barwise [00], and Gaifman [0]. 

2 Alternatively, i nteg rals as the above can be intended as generalised Riemann integrals [0, 0, 0] 
(see also [0, |f 0])- 

3 But 'similar' in which sense? The answer to this question cannot be given by plausibility theory, 
but can be formalised within it, as shown later. 

4 Even more appropriate, but too long, would be 'measurement scheme'. 
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We represent the situation, measurements, etc. by propositions. The situation 
by /, the measurement instances by M (r \ the outcomes of the Tth instance by R. 
(different instances of the same outcome are those with different r but identical i). 
Instances are thus generally denoted by an index (t) with r - 1, 2, . . . ; its range may 
be infinite or finite, a detail that will be always specified as it will be very important in 
later discussions. Other propositions will be introduced and defined later. With this 
representation, our question above simply becomes the assignment of the plausibility 

P(R {rN+L) A • • • A R (rN+{) \ M (Tn+l) A • • • A M (Tn+i) A R (rN) A • • • A R {r,) A /) (2) 

for all possible distinct r a , distinct i a , N ^ 0, and L > 0. 

A more convenient notation. If you are wondering what those M (T) are doing in 
the context of the plausibility, consider that the plausibility of observing a particu- 
lar outcome at the r instance of a measurement is, in general, the plausibility of the 
outcome given that the measurement is made, times the plausibility that the mea- 
surement is made (if the latter is not made the outcome has nought plausibility by 
definition): 

P(R\ I) = P(R\ MM) P{M\ /) + P{R\ -iM A /) P(-.M| I), 
= P(R\ MM) P(M\ I). 

But of course when we ask for the plausibility of an outcome we implicitly mean: 
given that the corresponding measurement is or will be made. We assume that knowl- 
edge of an outcome implies knowledge that the corresponding measurement has been 
made — symbolically, P(M^ T) |/?^ T ' A I) — 1 if t' = t" — hence there is no need 
of specify in the context the measurements of those outcomes that are already in the 
context. But the necessity remains of explicitly writing in the context the measure- 
ments of the outcomes outside the context. This can sometimes be notationally very 
cumbersome, and therefore we introduce the symbol Wl with the following conven- 
tion: in the context of a plausibility, always stands for the conjunction of all 
measurement instances M (T) corresponding to the outcomes on the left of the condi- 
tional symbol '['. Thus, e.g., 

P(R ( P A Rf } \ Wl A Rf ] A I) = P(4 ?) A fif } | M (1) A M (3) A Rf A /). 

Note that 9JT has not a constant value (it is in a sense a metavariable); we indicate this 
by the use of a different typeface (Euler Fraktur). 

With the new notational convention the plausibility (||) can be rewritten as 

P(R (rN+L) A • • • A R (rN+,) \ m A R (rN) A • • • A R (ri) A /). (EL 

3 The approach through infinite exchangeability 

The propensitor (and, roughly in the same way, the frequentist as well) approaches 
the question of assigning a value to (§) by supposing that the outcome instances ('tri- 
als') are 'i.i.d.': that they are independently 'produced' with constant but 'unknown' 
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propensities. As N in @ becomes large, the relative frequencies of the outcomes 
must tend to the numerical values of the propensities. These frequencies can then be 
used as 'estimates' of the propensities, and so the estimated propensity for outcome 
Rj at an additional measurement instance can be given. This summary surely appears 
laconic to readers that have superficial knowledge of this practise. But this does not 
matter: it is the approach of the de Finettian that interests us. 5 Recall that for the 
de Finettian, and for the Bayesian in general, 'probability' is not a physical concept 
like 'pressure', but a logical (and subjective) one like 'truth'. To mark this difference 
in concept we are using the term plausibility, which has a more logical and subjective 
sound. 

How would a de Finettian approach the problem above? More or less as follows: 

[Afictive de Finettian speaking:] 'Before I know of any measurement outcomes, 
I imagine all possible infinite collections of instances of the measurement M. Let 
us suppose that I judge two collections that have the same frequencies of outcomes 
to have also the same probability. The probability distribution that I assign to the 
infinite collections of outcomes is therefore symmetric with respect to exchanges 
of collections having the same frequencies. Such a distribution is called infinitely 
exchangeable. De Finetti's representation theorem [Q, ||, |6|, [7|, || [| 1C, [Tl], [71]] (see 
also Johnson and Zabell [13, [72], [73]]) says that any L-outcome marginal of such 
distribution, where the outcomes {/?,} appear with relative frequencies L = (Li), can 
be uniquely written in the following form: 

P(r£ l) A • • • A R\^\ m A 7) = f(n <ft) r(q\ I) dq, (3) 

Rl appeal's L\ times, etc. 

where q = (qj) are just parameters — not probabilities! — satisfying the same posi- 
tivity and normalisation conditions (qt ^ 0, = 1) as a probability distribution, 
and q i-> r(q\ I) is a positive and normalised generalised function, which can be 
called the generating function of the representation [cf. fT^\ . Example: I can write 
the probability of a collection of three measurement instances with two outcomes R5 
and one R% as 

P(4 Tl) A Rf 2) A R[ T3) \ TlAl) = Jq 5 2 q & T(q\ I) dq. 

In particular, the probability for the outcome i in the whatever measurement instance 
t can be written as 

P(/?| T) | M (T) Ai) = J q, r(q\ 7) dq. (4) 

A propensitor or a frequentist would say that the right-hand side of eq. (||) represents 
the fact that the outcome instances are "independent" (therefore "their probabilities 



5 There are authors who apparently keep a foot in both camps; see e.g. Lindley and Phillips' arti- 
cle[0] 
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qi are simply multiplied to give the probability of their conjunction"), "identically 
distributed" (therefore "the probability distributions q is the same for all instances"), 
and moreover "their probability is unknown" (therefore "the expectation integral over 
all possible probability distributions q"). But de Finetti's theorem shows that all these 
mathematical features are simply consequences of infinite exchangeability, and we 
do not need any "i.i.d." terminology. 

'The generating function q i-» r(q\I) is, by de Finetti's theorem, equal to the 
limit 

, _ , 'All possible collections of L outcomes with i„ ,\ 

r% I) dq = hm P ( frequ P endes . n ^ mnge ^ L(gi + I m A /) , (5) 

i.e., r(q\ I) is equal to the probability — assigned by me — that my imagined infinite 
collection of outcomes has limiting relative frequencies equal to (qi). My exchange- 
able probability assignment determines thus f uniquely. But de Finetti's theorem 
also says the converse, viz., any positive and normalisable V uniquely determines 
an infinitely exchangeable probability distribution. This allows me to specify my 
probability assignment by giving the function T instead of the more cumbersome 
probability distribution for infinite collections of outcomes. 

'Let us suppose that I am now given a collection of N measurement outcomes, 
and in particular their absolute frequencies ,/V = (Ni). Given this evidence, the proba- 
bility for a collection of further L unobserved measurement outcomes with frequen- 
cies (Li) is provided by the basic rules of probability theory: 

P(R (rN+L) A • • • A R {rN+[) \ 2H A R ( : Tn) A • • • A R (r,) A /) - 

1N+L 'N+l IN '1 

S, appeal's L, times Rj appeal's W, times 

P(R (rN+L) A • • • A R {rN+l) A R {rN) A • • • A R {ri) \ TIM) 

v 'N+L 1JM 1_N n ' 

P(R (rN) A ••• AR (ri) \Wl A I) 

Using de Finetti's representation theorem again, this probability can also be written 
as 

P(R ( : rN+L> A • • • A R {rN+[) \ Ti A R (Tn) A • • • A R (r,) A I) - 

IN+L 'N+l IN H 



J (n q-') m\ R% N) A • • • A Rf ] A I) dq. (7) 



The function q \-> r(q\Rj A ■ ■ ■ A Rj^ 1 ' A I) is different from the previous one 
q h> r(q\I), but the two can be shown [e.g., [FT]] to be related by the remarkable 
formula 

(n<zf) r( q \D 

F(q\ R^ A ••• A R^ A I) = , (8) 

f{llqf')r(q\I)dq 

i 

which is formally identical with Bayes' theorem if we define 
f(Rf } A R ( ^ A ■ ■ ■ A R^ L) \ q, I) := q h q h ■ ■ ■ q k for all L, i a , and distinct r a , (9) 
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which includes in particular 

r{R ( p\q,I):= qi for all T. (10) 

(Note that these are only formal definitions and not probability judgements, since the 
various Ts are not probability distributions.) Thus I can not only specify my infinitely 
exchangeable distribution by f , but also update it by 'updating' f. Moreover, for N 
enough large this function has the limit 

r(q\R\l N) A---AR^ Ar)^b(§-q) as N -> oo. (11) 

* ^ 

R\ appears Nj times 

Comparing with eq. @, this means that 

p(^+l)| M (r N+t ) A R (r N ) A . . . A R (r0 A ;) „ J_ &s N ^ ^ ( with T ± T J j ( 12 ) 
Ri appears Nj times 

i.e., the probability I assign to an unobserved outcome gets very near to its observed 
relative frequency, as observations accumulate. This also means that two persons 
having different but compatible initial beliefs (i.e., different initial exchangeable dis- 
tributions having the same support) and sharing the same data, tend to converge to 
similar probability assignments.' 



The point of view summarised above by the de Finettian is quite powerful. It 
allows the de Finettian to make sense, without the need of bringing along ugly or 
meaningless metaphysical concepts, of those mathematical expressions very often 
used by propensitors (and frequentists as well) that are formally identical to formu- 
lae (Q) ('expected propensity' or 'estimation of unknown probability'), (|5|) ('proba- 
bility as limit frequency'), @ (definition of 'propensity'), (|l^) ('probability equal to 
past frequency'). This point of view and the notion of exchangeability can moreover 
be generalised to more complex situations g §, |, [10|, 0, 0, [75|, [7|][cf. also [77|], 
leading to other powerful mathematical expressions and techniques. 



4 Why a complementary approach? 

We shall presently present and discuss another approach, not based on exchangeabil- 
ity or representation theorems, that can be used to give an answer to the question 
of induction, and to make sense of the propensitors' and frequentists' formulae and 
practise as well. Why did we seek an approach different from that based on infinite 
exchangeability? Here are some reasons: 

(a) One sometimes feels 'uncertain', so to speak, about one's plausibility as- 
signment. One can also say that some plausibility assignments feel sometimes more 



'stable' than others. This is lively exemplified by Jaynes [21, ch. 18]: 



7 



Porta Mana, Mansson, Bjork 



The Laplace-Jaynes approach to induction 



Suppose you have a penny and you are allowed to examine it carefully, 
convince yourself that it's an honest coin; i.e. accurately round, with head 
and tail, and a center of gravity where it ought to be. Then, you're asked to 
assign a probability that this coin will come up heads on the first toss. I'm 
sure you'll say 1 /2. Now, suppose you are asked to assign a probability to 
the proposition that there was once life on Mars. Well, I don't know what 
your opinion is there, but on the basis of all the things that I have read 
on the subject, I would again say about 1/2 for the probability. But, even 
though I have assigned the same "external" probabilities to them, I have a 
very different "internal" state of knowledge about those propositions. 

To see this, imagine the effect of getting new information. Suppose 
we tossed the coin five times and it comes up tails every time. You ask 
me what's my probability for heads on the next throw; I'll still say 1/2. 
But if you tell me one more fact about Mars, I'm ready to change my 
probability assignment completely. There is something which makes my 
state of belief very stable in the case of the penny, but very unstable in the 
case of Mars. 

An example similar to the Martian one is that of a trickster's coin, which always 
comes up heads or always tails (either because the coin is two-headed or two-tailed, 
or because of the tosser's skills). Not knowing which way the tosses are biased, you 
assign 1 /2 that the coin will come up heads on the first observed toss. But as soon 
as you see the outcome, your plausibility assignment for the next toss will collapse 6 
to 1 or 0. The 'uncertainty' in the plausibility assignments given to the penny and 
to the trickster coin's toss could be qualitatively pictured as in Fig. 0. We think 
that this 'uncertainty' in the plausibility assignment — or better, as Jaynes calls it, 
this 'difference in the internal state of knowledge' with regard to the propositions 
involved, is a familiar and undeniable feeling. There is in fact a rich literature which 
tries to take it into account by exploring or even proposing alternative plausibility 



theories based on probability intervals or probabilities of probabilities (see e.g. [{78 
79[ |8(], H H 0» es P- § 3.1; H, |4|l and cf. [§5], § 2.2]). 



(b) A plausibility assignment, according to de Finetti's approach, begins with 
a judgement of exchangeability. One is not concerned within the formalism itself 
with the motivation of such a judgement (de Finetti recognises that there usually 
is a motivation, but it is relegated to the informal meta-theoretical considerations). 
Yet such judgements are often grounded, especially in natural philosophy, on very 
important 7 reasonings and motivations which one would like to analyse by means of 
plausibility theory. 8 



6 Just like a quantum-mechanical wave-function. 
7 And, trivially, subjective. 

8 In a way there is a point in Good's statement | p5j ch. 3] 'it seems to me that one would not 
accept the [exchangeability assumption] unless one already had the notion of physical probability and 
(approximate) statistical independence at the back of one's mind', although there is no need to bring 
'physical probabilities' and 'statistical independence' along. 
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Figure 1: Qualitative illustration of the 'uncertainty' in the plausibility assignments 
for 'heads' in the case of a fair coin-toss (blue dashed curve, with maximum at 1/2) 
and the toss of a trickster's coin (orange continuous curve, with maxima at and 1). 



The illustration is made quantitative in § 6.2 



(c) When the maximum number of possible observations (measurements) is fi- 
nite and small it does not make sense to make a judgement of infinite exchangeability, 
not even as an approximation (cf. [p^Ipi § 4.7. 1]). De Finettians can in this case use 



finite exchangeability [Q, 10 1, but then they cannot make sense of the propensitor's 



formulae like (|4j) or @, which the propensitor, however, is still entitled to use even 
in this case. 

(d) Finally, another important reason is that the generalisation of de Finetti's the- 



orem studied by Caves, Fuchs, and Schack [|8g, |87|, |88|, |89j fails for some physical sta 



tistical models, viz. quantum mechanics on real and quaternionic Hilbert space. We 
consider this failure not as a sign that complex quantum theory is somehow 'blessed' , 
but rather as a sign that either the theorem can be generalised in some other way that 
holds in any physical statistical model, or de Finetti's approach can be substituted or 
complemented by another one whose generalisation applies to any physical statisti- 
cal model whatever. The point here is that de Finetti's theorem — just like the whole 
of plausibility theory — belongs to the realm of logic, not physics, and thus should 
apply to any conceivable physical theory that be logically consistent, even one that 
does not describe actual phenomena. 

The alternative point of view to be presented meets all the points raised above: 
(a) It allows us to formalise within plausibility theory — i.e., without resorting to 
interval-valued plausibilities and the like — the intuitive notion of 'uncertainty' of a 
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plausibility assignment, and even to quantify it. (b) It stems directly from an analysis 
of the conditions in which judgements of exchangeability usually originate and apply, 
(c) It can be used to interpret the propensitor's practise when the maximum number 
of possible observations is finite and small, (d) It readily generalises to any logically 
consistent physical statistical model, whether it apply to actual phenomena or not; 
in particular, within quantum theory it easily applies to such problems as quantum- 
state assignment, quantum-state 'teleportation', and others that involve the notion of 
'unknown quantum state'. 

This point of view, which we shall call the 'Laplace-Jaynes approach', stems 
from a re-reading and possibly a re-interpretation of Jaynes [[j], esp. pp. 11, 12, 15] 
(cf. also [0, lect. 18] [[20], lect. 5][|2| ch. 18]), Laplace @, and de Finetti g § 20] 
(cf. also Caves [^llj) and is strictly related to the 'approach through circumstances' 
presented in the previous paper [gp. Mosleh and Bier also propose and discuss 
basically the same point of view. 

It cannot be too strongly emphasised that this point of view is fully in line with 
plausibility theory, Bayesian theory, and de Finetti's point of view: Some plausibili- 
ties with particular properties will be introduced, based on particular judgements; but 
the acceptance or not of the latter is, as with judgements of exchangeability, always 
up to the individuals and their knowledge. 

We now present this 'Laplace-Jaynes approach' and show how it can be used 
for the question of induction in a manner parallel to the approach through infinite 
exchangeability. We shall then discuss how it meets points 0, H above (not neces- 
sarily in that order). The discussion of the fourth point is left to the next study of this 
series (in preparation). 

5 The Laplace-Jaynes approach 
5.1 Introducing a set of circumstances 

With the notation and the general settings of the introduction, let us recapitulate 
how we reason and proceed in the case of exchangeability. The situation / leads 
us to assign an infinitely exchangeable plausibility distribution to the collection of 
measurement outcomes. In other words, / is such that we consider two plausibilities 
like e.g. P(Rf ) A R^ 4} \ Wl A /) and P(Rf A R^\ 9JT A /) as equal. 

In § |], point (g), we remarked that the approach through exchangeability is not 
formally concerned about those details of the situation which lead us to see a sort 
of analogy ar similarity amongst different measurement and outcome instances and 
thence make a judgement of exchangeability. Rather, the whole point is that this 
similarity is expressed by, or reflected in, the exchangeable plausibility assignment. 
As de Finetti says, 'Our reasoning will only bring in the events, that is to say, the 
trials, each taken individually; the analogy of the events does not enter into the chain 
of reasoning in its own right but only to the degree and in the sense that it can influ- 
ence in some way the judgment of an individual on the probabilities in question' [Q 
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p. 120]. And also: 

What are sometimes called repetitions or trials 9 of the same event are for 
us distinct events. They have, in general, some common characteristics 
or symmetries which make it natural to attribute to them equal proba- 
bilities, but we do not admit any a priori reason which prevents us in 
principle from attributing to each of these trials [. . . ] some different and 
absolutely arbitrary probabilities [...]. In principle there is no difference 
for us between this case and the case of n events which are not analo- 
gous to each other; the analogy which suggests the name "trials of the 
same event" (we would say "of the same phenomenon") is not at all es- 
sential, but, at the most, valuable because of the influence it can exert on 
our psychological judgment in the sense of making us attribute equal or 
very nearly equal probabilities to the different events.' [Q, p. 113, foot- 
note 1] 

But although we agree on this terminology and its motivation, we do not agree 
on the unqualified and indiscriminate diminution of the importance of the similarity 
amongst measurement instances. Such similarity, especially in the natural sciences, 
often stems from or is traced back to similarities at deeper 10 levels of analysis and 
observation. It is by this process that natural philosophy proceeds. 

So let us suppose that the considerations that lead us to a probability assignment 
for various measurement instances can be analysed — and formalised — at a deeper 
level. 1 1 More precisely, we suppose to have identified for each measurement instance 
t a set of diverse possible 'circumstances' {C^ T) : j - I,...} — all these sets having 
the same cardinality — with the following properties: 

I. The first involves the plausibilities we assign to the circumstances: 

P(C ( p\ D Al) = P(C ( p\ I) for all j and all D representing conjunc- 
tions of measurements (but not of out- 
comes!) from the same or other instances, 

p(c ( ; ) Adp\i) = o if/*/', (I) 
p(vc; t) i/) = zp(c; t) i/) = i, 

i.e., we judge the {C^p : j = 1, . . . } to be mutually exclusive and exhaustive, 
or in other words we are certain that one of them holds, but we do not know 
which. Moreover, we judge any knowledge of measurement instances (but not 

9 And what we call here instances (Authors' Note). 
10 We do not mean 'microscopic'. 

"Using, once more, words by de Finetti: 'we enlarge the analysis to include our state of mind in 
relation to other events, from which it might or might not be independent and by which, consequently, 
it will or will not be modified if they occur, or if we learn of their occurrence' ||93|, § 28]. 
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of outcome instances) as irrelevant for assessing the plausibilities of these cir- 
cumstances, or in other words, simply knowing that a measurement is made 
does not give us clues as to the exact circumstance in which it is made. 12 

II. The second involves the plausibility distribution we assign to the outcomes, 
conditional on knowledge of a circumstance: 

P(/?h M (T) ADA d T) A /) - P(R ( . r) \ M (T) A C { ? A /) 

for all t, i, j, and all D representing a conjunction of measurements, 
measurement outcomes, and circumstances of instances different 
from t, (II) 

which means that if we were certain about any circumstance dp for the in- 
stance r we should judge knowledge of any data concerning other instances as 
irrelevant for the assessment of the plausibility of the outcome of instance t. 
(The expression above is undefined if Cy happens to be inconsistent with D, 

i.e. if P(D\ Tl A dp A I) = 0; but this will not cause problems in the following 
analysis.) 

III. The third property concerns the relationships amongst our plausibility assign- 
ments for the outcomes in different instances: 

P(R (T ' ] \ M ir,) A dp A I) = P(/? (T " } | M (T " } A d r " ] A I) =: qu for all t', t", 

1 J 1 J J 

(III) 

which expresses the fact that we see a similarity between outcomes and circum- 
stances of different instances, like when we say 'the same outcome', 'the same 
measurement', or 'the same circumstance'. It is thanks to this property that 
we can often drop the instance index '(t)' and make sense of expressions like 
'P(7?;| M A C j A /)', which can stand generically for 

P(Ri\ M A Cj A I) := P{R {r) \ M {r) A d r) A I) for any t, 

7 ' (13) 

= q u . 

In the following, expressions like 'P(/?,| M A Cj A /)' will be understood in the 
above sense. 

IV. The fourth property strengthen the similarity amongst instances, and is the one 
that makes induction possible: 

P(dp\ dp A I) = 8 ,v » for all t', t". (IV) 

This means that we believe that if a particular circumstance holds in a particular 
instance, then the 'same' circumstance (in the sense of (£nj) and (^V|)) holds in 

12 This assumption can be relaxed. 
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all other instances. This property implies (and, with the help of (0), is implied 
by) the following: 



for all t and j, P(C\ L '\ I) = P(A C '\ 1) =: y, (14) 

a J 

for some yf, and 

(n\ \Yi if 7cr = 7 for all a, 
P(AC (a) \I) = \ n JlT J (15) 
a ]a 10 otherwise, 

i.e., the plausibility that we assign to the Tth instance of the jth circumstance 
is equal to that assigned to the collection consisting exclusively of 'repetitions' 
of the jth circumstance. Collections consisting of non-corresponding circum- 
stances have nought plausibility. This allows us to define 

P(Cj\I) := P(AaC ( p\I), 

= P(Cj T) |7) for any t, (16) 

s rj> 

and 

Cj:=ACf\ (17) 

Thanks to the definitions above the expression 'P(C ; |/)' can be used to unam- 
biguously denote the plausibility that 'the "same" circumstance 'Cf holds in 
all measurement instances'. 

Note that in all the properties above the number of instances ( i.e., the range ofr) can 
be either infinite or finite. 

Before further commenting the above properties, let us try to partially answer 
the question: what are these 'circumstances'? The most general and precise answer 
is: they are whatever (propositions concerning) facts you like that make you assign 



plausibilities satisfying properties (Q)-(|V|). No more than this would really need be 



said. But we can emphatically add that the circumstances need not concern 'mecha- 
nisms', 'causes', 'microscopic conditions', or the like; and that they do not 13 concern 
'unknown probabilities'. 14 They concern details of the context that are unknown, but 
that (we judge) would have a great weight in our plausibility assignment for the {R^} 
if we only knew them; so great a weight as to render (practically) unimportant any 
knowledge of the details of other measurement instances. As for the choice of such 



13 Cannot, in a logical sense; cf. Remark 2 in 

14 But the circumstances may concern 'propensities', if the latter are intended as sorts of (ugly) 
physical concepts. This possibility is due to the generality of plausibility theory which, like classical 
logic, does not forbid you to bring along and reason about unreal or even preposterous concepts and 
entities, like 'nagas' [g^J], 'valier' 'propensities', and 'wave-particles', provided you do it in a 
self-consistent way. 
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details, we have complete freedom. A trivial example: the tosses of a coin which we 
have not examined, but which we know for sure to be two-headed or two-tailed. In 
this case the propositions 'Coin is two-headed (at toss t)' and 'Coin is two-tailed (at 
toss t)' form, for each t, a set of circumstances that satisfy property (|). But we also 
know that the coin is the same at all tosses, which implies property (PV|); hence we 
can omit the specification '(at toss r)' without confusion. If we knew that the coin 
was two-headed (at toss t) we should assign unit plausibility to heads for all future 
or past tosses, i.e., 

P('Heads at toss t'| 9Jt A 'Coin is two-headed (at toss t)' A /) = 1 for all T, 
and we should consider knowledge of the outcomes of other tosses as irrelevant. 15 An 



analogous discussion holds for the two-tails possibility. Thus properties (II) and (HI) 
are also satisfied. 

This was an extreme example, for the plausibilities conditional on the circum- 
stances were nought or one. But it needs not be so: other circumstances could lead 
to less extreme conditional plausibility judgements. In general, remaining within the 
coin toss example, we could ask: By which method is the coin tossed? What are its 
physical characteristics (two-headedness, centre-of-mass position, elastic and rigid 
properties, etc.)? Upon what is it tossed? Who tosses it? 16 Can there be any symme- 
tries in my state of knowledge in respect of the situation? What are the consequences 
of the toss or of the outcomes? — and a set of circumstances could be distilled from 
the possible answers to these and other questions. Note in particular, with regard 
to the last two questions, that a circumstance may be a judgement of symmetry or 
may also be a consequence, in some sense, of the outcomes. Such kinds of circum- 
stances are perfectly fine as long as they lead you to make a plausibility assignment 



on the outcomes and satisfy the properties (g)-(|yp. This emphasises again the fact 
that a circumstance needs not be a sort of 'mechanism' or 'cause' of the measure- 
ments or of the outcomes. Note also that properties (|) and determine neither 
the plausibilities P(C^ T) | I) nor the P(R\ r) \ M (t) A C { p A /). These plausibilities are, a 
de Finettian would say, 'fully subjective'. 



By properties (II) and fllllD we can interpret and give a meaning to the locution 'in- 
dependent and identically distributed events'. Such locution means only that we are 
entertaining some circumstances that, according to our judgement, render the con- 
ditional plausibilities of corresponding outcomes of different measurement instances 
equal, so that we need not specify the particular instance ('identically distributed'); 
and render knowledge of outcomes of other instances irrelevant, so that we can spec- 
ify the conditional plausibility for an outcome independently of the knowledge of 
other outcomes ('independent'). The locutions 'unknown probability' and 'probabil- 



ity of a probability' will also be interpreted in a moment (§ 5.3). 



15 Because we can but expect all tosses to give heads. Note that if a toss has given or will give tails, 
this signals a contradiction in our knowledge. I.e., some of the data we have (about the coin or about 
toss outcomes) have to be mendacious. But this is a problem that does not concern plausibility theory. 
Like logic, it can give sensible answers only if the premises we put in are not inconsistent. 

16 See Jaynes' insightful and entertaining discussion [ pl| , ch. 10] on these kinds of factors. 
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5.2 Approaching the problem of induction through a set of 
circumstances 

How do we face the question of induction with these 'circumstances' and the as- 
sumptions that accompany them? We answer this question in two steps. 

Having introduced sets of circumstances [C^p], we must assign (subjectively, a 
de Finettian would say) for all i and j the plausibilities 

P(Ri\ M A Cj A I) = qij (:=P(^ T) |M (T) AC ( p A 7) for any t), (18) 
P(C;| /) = Tj (:= P(A 1 = p «?f I D for any t), (19) 

in the sense of eqs. (13) and (|l6|). It is then a simple consequence of the rules of plau- 
sibility theory, together with the properties and definitions of the previous sections, 
that the plausibility we assign to any collection of measurement outcomes is given 
by 

P(R {TL) A ■ ■ ■ A R (r,) \Wl A I) = 

V ' 

R\ appears L\ times, etc. 

P(Ri\ MACjA I) L ] P(Cj\ I) = J](n qfj P(Cj\ I). (20) 

j ' J 1 

Note how this plausibility assignment depends only on the frequencies (L,) of the 
outcomes: it is an (infinitely) exchangeable assignment — although exchangeability 
was not our starting assumption. 

Let us suppose that we are now given a collection of N measurement outcomes, 
and in particular their absolute frequencies ,/V = (TV,-). Given this evidence, the plausi- 
bility for a collection of outcomes, with frequencies (L,), of further L measurements 
is also derived by the basic rules of plausibility theory, from our initial plausibility 
assignments (18) and (|l9|), using expression (|0|) and the properties of the circum- 
stances: 

P(R (Tn+l) a • ■ • A R (rN+[) \ Tl A R (rN) A ■ ■ ■ A R (ri) A I) — 

in+l in ^ '1 

S, appeal's L, times Rj appeal's Nj times 

JJU P(Ri\ MACjA I) L ] P(Cj\ R^ A • • • A A 7) = 

YjYl^r(Cj\R^A...AR^Al), (21) 



J 



with 



(Uq^PiCjH) 

Efn^) new 



P(Cj\ Bf™ A • • • A R™ A I) = —- . (22) 
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These formulae present many similarities to the de Finettian's (0) and (§), and 
to the formally similar expressions used by 'frequentists' and 'propensitors'. But it 
should be noted that, in the present formulae, := P(/?,| 9JT A Cj A /), P(C/| /), and 
P(C 7 | Ry N ' A • • • ARy Al) are actual plausibilities, not just parameters or positive and 
normalised generating functions. ('And they are fully subjective!', the de Finettian 
reiterates). 



5.3 Plausibility-indexing the set of circumstances 

The marked similarities can in fact be made into identities of form. To achieve this, 
we go back to the point where, afti 
made the plausibility assignments 



we go back to the point where, after having introduced the circumstances {C - }, we 



P{Ri\ M A C j A I) = qij (:= P(^ T) | M (T) A C ( p A I) for any t), (|l8]) r 

P(Cj\ I) = Yj (:= P(A a cf >| I) = P(Cf| /) for any r), © r 

for all i, j (and t). Instead of proceeding as we did, we now follow the remark by 
Jaynes [JT], p. 12], cited in the introductory epigraph: 

It is simply convenient to index our hypotheses by parameters [q] cho- 
sen to be numerically equal to the probabilities assigned by those hy- 
potheses; this avoids a doubling of our notation. We could easily restate 
everything so that the misconception could not arise; it would only be 
rather clumsy notationally and tedious verbally. 

What Jaynes calls 'hypotheses' we have here interpreted, more generically, as 'cir- 
cumstances'. Let us now group together those that lead to the same plausibility 
distribution for the outcomes. That is, fix a t, and form the equivalence classes of 
the equivalence relation 

C*? ~ C ( .? <=> for all i, P(R (r) \ M (T) A A /) - P(R {r) \ M (T) A d$ A I). (23) 

J J ' J ' J 

By the very method these classes are defined, each one can be uniquely identified by 
a particular set of values (g,) = q of the plausibility distribution for the outcomes. 



Note that owing to property (gDj) the value of q does not depend on t. Call therefore 
q~ the class identified by a particular q, and denote membership of C^ T) by 'j e q~' 
for short. Now let us take the disjunction of the circumstances in each class q~ and 
uniquely denote this disjunction by S ? : 

4 T) : = V Cf. (24) 

It is easy to see that, owing again to property (|), each yields a conditional 
distribution for the outcomes that is numerically equal to q, i.e. numerically equal to 
all those yielded by the C^ T) in q~ : 

P(/^ T) | M (T) A S f ] A I) = q t for any r. (25) 
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Moreover, the {S ^ } also satisfy properties (@)-(fv|) as the circumstances |C^ T) }, and 
their plausibilities are easily obtained from those of the latter: 

P(4 T) |/)= I P(C;. T) |7), P(S- q \I)= S P(Cj\I), (26) 



where S q := Ac^s m analogy with the definition (17) 



These S ^ are thus a sort of 'plausibility-indexed' circumstances. We shall call 

them also 'coarse-grained circumstances' sometimes. The C^ T) can then be called 
'fine-grained' circumstances when a distinction is necessary. 

In general, we have a class q~ and a disjunction S^ T) for particular (vector) values 
of q only (depending on the initial choice of circumstances and on the initial assign- 
ments (|l^)). But we can formally introduce propositions S~ , all equal to the false 
proposition (A A -A), for the remaining values q' . These propositions hence have 
nought plausibilities, P(S^ } \I) = 0. The plausibilities P(/?| T) | M (T) A A I) are 
undefined, but they will appear multiplied by the former in all relevant formulae, and 
hence their product will vanish by convention. With this expedient we can consider 
the whole, continuous set [S ^ \ q e A} for all possible distributions q which natu- 
rally belong to the simplex A := {{q,) \ qi ^ 0, 2/<7i = !}• We can thus substitute 
an integration J A . . . p(S q \ . . . ) dq for the summation Y> q ■ ■ ■ P(S§I ■ • ■ )■ The density 
p(S q \ . . . ) will be a generalised function. 17 

We can therefore analyse the context I through the plausibility-indexed circum- 
stances {S q r) }, instead of the {C^ T) (. The rationale behind this is that those circum- 
stances Cy T) that belong to a given class q~ have all the same effect on our judgement 
as regards the assignment and the update of the plausibilities of the outcomes; more- 
over, the ratios of their updated plausibilities P(C^| ... A 7) cannot change upon 
acquisition of new measurement outcomes and are equal to those of their prior plau- 



sibilities, see piOq , [107J and ||2[ remark 3 and § 4.2]. It is therefore not unreasonable 
to handle equivalent C^ T) class-wise. The special notation chosen for the different 

classes, 'S^', the index 'g' in particular, is to remind us on the grounds of what 
plausibility judgements (the q) we grouped the original circumstances thus in the 
first place. But it is only a notation, nothing more. Each S^ T) is a disjunction of 
propositions like, e.g., 'The coin is two-headed, or the tosses are made by a trickster 
with a predilection for heads, or ... ' — it is not a statement about the plausibilities 
qi (nor about 'propensities'). 

The effect of this 'bookkeeping' notation is, however, surprising for the form 



our induction formulae ([20|)-([22|) take when expressed in terms of the plausibility- 



indexed circumstances {S^^}- The plausibility we assign to any collection of mea- 



17 We can avoid the expedient above if we li ke; t hen the integral s that follow m ust b e und erstood 
in a measure-theoretic sense |§, ||, [m], [fin], [l02] |To3|] (cf. also [|l|, [l04|, [lo|]), with 



p(Sg\ . . - )dq' standing for appropriate singular measures. 
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surement outcomes takes now, by eq. (|25|), the form 



P(R {L) A ••• AR (l) \m A I) = 

R[ appears L\ times, etc. 

J [nP(/?,-|MAS 5 A/) L '] p(S- g \I)dq = J (Uqf') p(S q \I)dq, (27) 

with value numerically equal to that of (2^), and with the same remarks and conven- 
tions about the index r as made in § |5T . The expression above is formally identical 
to the de Finettian's (0) with the correspondence p(Sq\I) r(q\I). In particular, 
the plausibility for the outcome of any instance takes the form 

P(/?| T) | M (t) A /) = J qi V {S q \ I) dq, (28) 

formally identical to @. 

The plausibility conditional on the observation of N outcomes takes the form 

P ( R (tn + l) a . . . a r (Tm)\ m A R (r N ) A _ _ _ A gfa) A 7) = 

'N+L iJV+1 IN >l 

•~ - s ^ ' 

Ri appears L, times i?, appears JV,- times 

J [n P(Ri\ M A S g A 7) L <] V (S q \ Rf N N) A • • • A Rf ] Al)dq = 

J (I! 9?) pO^I*^ A • • • A 4 T1) A Z)d? (29) 



with 



(n#)p(5 ? i/)d^ 

i 



p(,S 9 | < Tw) A • • • A /?< Tl) A 7) dq = ——- — — , (30) 



again formally identical to the de Finettian's ^fy and <JsJ> _ 

Apart from the congruence between their mathematical forms, the above formu- 
lae and those derived from exchangeability have very different meanings. Whereas in 
the exchangeability approach the q = {q{) were just parameters, in the Laplace-Jaynes 
approach they are (numerical values of) actual plausibilities, viz. the P(/?| T) | A 
Cj A I) or the P(/?j T) |M (T) A A I). Whereas in the exchangeability approach 
q i-> r(q\ . . . ) was only a generalised function, in the Laplace-Jaynes approach 
q i — > p(S q\ . . .) is (the density of) an actual plausibility distribution — a distribu- 
tion, however, not over 'probabilities' or 'propensities', but rather over propositions 
like 'The coin is two-headed, or the tosses are made by a trickster with a predilection 
for heads, or . . . '. 

Equations (29) and (30) (as well as ( j2"Tl ) and (22)) show that the observation of 



measurement results has a double 'updating' effect in the circumstances approach. 
Not only are the plausibilities of unobserved results updated, but those of the cir- 
cumstances as well; in fact, the former updating happens through the latter. This is 
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related to what Caves calls 'learning through a parameter' [^IJ], although there are no 
parameters here, only plausibilities. Also in the exchangeability approach, one could 
argue, is the generating function r(q\I) updated; but its updating only represents 
and reflects a change in our uncertainty or degree of belief about the outcomes, no 
more than that. In the Laplace-Jaynes approach, the updating of p(S q\I) represents 
instead a further change in our uncertainty about events or phenomena other than the 
outcomes. We shall return to this in a moment. 

We finally also see how the Laplace-Jaynes approach interprets and makes sense 
of the notions of 'unknown probability (or propensity)' and 'probability of a proba- 
bility': what is unknown is not a probability, but which circumstance from a set of 
empirical ones holds; the second probability is therefore not about a probability, but 
about an empirical circumstance. 

5.4 Limit for large number of observations 

Let us suppose that the plausibility distribution p(Sq\I)dq does not vanish for any 
q. This implies that the set of 'fine-grained' circumstances [Cj] is a continuum; with 
analytical and topological care this case should not present particular difficulties. 



That assumption being made, from eq. ( p0| ) we have that as the number N of 
observations increases, the updated distribution for the plausibility-indexed circum- 
stances asymptotically becomes 

p(5 q \Tl A A • • • A A /) a 5(| - q) as N -» oo, (31) 
- 

R, appears JV,- times 

which is formally identical to the de Finettian's (|Tl|). It also follows that 

P(R {rN+[) \ M (TN+l) A R ( : Tn) A • • • A R {ri) A I) * — as Af — > oo, (32) 

v i ' in ^ n N 

Rj appears Nj times 

exactly as in (ff2[). Also in the Laplace-Jaynes approach, then, the plausibilities as- 
signed to unobserved outcomes get nearer their observed relative frequencies as the 
number of observations gets larger (under the assumptions specified above). And 
persons sharing the same data and having compatible initial plausibility assignments 
tend to converge to similar plausibility assignments, as regards their respective sets 
of plausibility-indexed circumstances (and as regards unobserved measurement out- 



comes). Cf. the discussion in §§ 6.2 and 5.4, and see also Jaynes [21, ch. 18] (see 



also []19[ lect. 18] g, lect. 5]). 

What happens if the distribution p(Sq\I)dq vanishes at, or in a neighbourhood 
of, the point (qi) - (Ni/N), as may be the case when the number of 'fine-grained' 
circumstances {Cj} is finite? It happens that the updated distribution gets concen- 
trated at those S § (and those Cj) with q (respectively {qu)) nearest (Ni/N) and for 
which the initial plausibility does not vanish. One should make the adjective 'near- 
est' topologically more precise. There are also interesting results at variance with the 
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asymptotic expression ( |l2| ) when (Ni/N) cannot be obtained as a convex combination 
of those q (respectively (qu)) whose related plausibilities do not vanish. All this is 
left to another study. 

6 Discussion 

6.1 Relations between the circumstance and exchangeability 
approaches 

The Laplace-Jaynes approach and the infinite-exchangeability one do not exclude 
each other and may be used simultaneously in many problems. In fact, if in a 
given problem we can introduce 'circumstances', and the number of measurement 
instances is potentially infinite, the resulting distributions for collections of outcomes 
are then infinitely exchangeable and all the results and representations based on ex- 
changeability also apply, beside those based on the circumstance representation. This 
leads to the following powerful proposition: 

Proposition. In the Laplace-Jaynes approach, if the number of measurement in- 
stances is potentially infinite then 

p{r {Tl) a • • • a R (r[) \ art a d a i) = 

* ^ ' 

R\ appears L\ times, etc. 

J (11 qf) r® D A I) dq = f (n qf') pOS 9 \ DAI) dq, (33) 

and in particular 

r(q\DAl)dq = p(Sg\D Al)Aq, (34) 

for any L, {i a }, and any D representing a (possibly empty) conjunction of measure- 
ment outcomes not containing the instances Tj , . . . , X]_. 

The equalities of the above Proposition may be read in two senses, whose mean- 
ing is the following: If in a given situation we have introduced circumstances {Sq\ (or, 
equivalently, {C,}) and assigned or calculated their plausibility density p(S q\D A I), 
then we also know, automatically, the generating function F(q\D A I) of de Finetti's 
representation theorem, which we should have introduced had we taken an exchange- 
ability approach. Vice versa, if we approach the problem through exchangeability 
and assign an infinitely exchangeable distribution to the possible infinite collections 
of outcomes, we have automatically assigned also the plausibility density p(5^| DA I) 
for whatever set of plausibility-indexed circumstances {S q ] we might be willing to 
introduce. The density of the Laplace-Jaynes approach and the generating function 
of the exchangeability one are, in both cases, numerically equal. 

What is remarkable in the second direction of the Proposition is that the plau- 
sibilities of the plausibility-indexed circumstances are completely determined from 
our exchangeable plausibility assignment, even when we have not yet specified what 
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those propositions are about! Although remarkable, this fact is a consequence of 
the particular way the plausibility-indexed circumstances are formed from the 'fine- 
grained' ones. Note, moreover, that the plausibilities of the 'fine-grained' circum- 
stances {Cj} would not be completely determined in general, even though their values 
would be constrained by eq. (26). 

Some philosophical importance has the fact that p(5^| . . . ), numerically equal to 
r(q\ .. .), is an 'actual' plausibility distribution, whereas the latter is only a gener- 
ating function. It is practically impossible to specify most infinitely exchangeable 
plausibility distributions directly, and this is one of the main points and advantages 
of de Finetti's theorem: a de Finettian can specify an infinitely exchangeable distri- 
bution by making a (often necessary) detour and specifying V instead. It is, however, 
a bit disconcerting the fact that one is almost always forced to specify the 'as if in- 
stead of what according to de Finetti is 'the actual plausibility'. The Laplace-Jaynes 
approach gives instead a direct meaning to the generating function r as a plausibility 
distribution, so that one needs not feel embarrassed to specify it. 

Can all situations which can be approached through infinite exchangeability also 
be approached from the Laplace-Jaynes point of view? Asking this means asking 
whether all situations for which we deem exchangeability to apply can be analysed 
into circumstances having the properties (|)-(IV). There is surely much place for 
discussion on the answer to this question, if by 'circumstances' we really mean, in 
some sense, 'interesting circumstances'. One could also wonder whether on a formal 
level the answer could be 'yes'. The reason is that the Laplace-Jaynes approach 
would seem to formally include the infinite-exchangeability one. To see how, make 
first a judgement of infinite exchangeability, and then introduce and define a set {Cy} 
of propositions stating that the limiting relative frequencies of an infinite collection 
of outcomes have certain values /. This should be possible since the existence of 
this limit under exchangeability is guaranteed by de Finetti's theorem. Intuitively, 
such a set {Cj} =: {CP} for any r would seem to constitute (under a judgement of 
infinite exchangeability) a set of circumstances that satisfy properties (|)-(IV). If 
this were true then, whenever infinite exchangeability applied, the Laplace-Jaynes 
approach would be at least formally viable. But intuition is not a reliable guide here 
(the definition of a proposition like Cj, e.g., would apparently involve a disjunction 
over the set of permutations of natural numbers), and the above reasoning is not 
mathematically rigorous. We have not the mathematical knowledge necessary to 
rigorously analyse and answer this sort of conjecture, and since we find it in any case 
uninteresting, let us not discuss it any further. It can be added, however, that any 
criticism from de Finettians with regards to the infinities involved in the argument 
would be like sawing the branch upon which they are sitting themselves, since we do 
not see how one can effectively make a judgement of infinite exchangeability without 
first considering possibly problematic propositions like {Cj}. 
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6.2 'Unsure' and 'unstable' plausibiliy assignments 

The presence of circumstances in the analysis of the problem, and the consequent 



fact that the plausibility of the outcomes can be decomposed as in eq. (28), suggest 
an interpretation of those feelings of 'uncertainty' and 'stability' about a plausibility 
assignment mentioned in § ||, point (||). A person might feel 'sure' about an assign- 
ment 1/2 to heads in the situation I s because of the following, perhaps unconscious, 
reasoning: 'If I knew that the coin was two-headed or the toss method favoured 
heads, I'd assign 1 to heads. If on the contrary I knew that the coin was two-tailed 
or the toss method favoured tails, I'd assign to heads. If I knew that the coin was 
a usual one and the tosser knew nothing about tossing methods, I'd assign 1/2 to 
heads. But the coin, although I gave to it a rapid glance only, seems to me quite com- 
mon and symmetric; and I know that the tosser, a friend of mine, knows no tossing 
tricks. So I can safely exclude the first two possibilities, and I'm practically sure of 
the third. Yes, I'll give 1/2 to heads'. In this reasoning, the person entertains (for 
each t) a set of possibilities, like 'The coin is two-headed', 'The toss method favours 
tails', etc. From these a set of three plausibility-indexed circumstances S q, with 
q - (<7heads,<?taiis), 18 is distilled, corresponding to the q values (1,0), (1/2, 1/2), and 
(0, 1). To these circumstances the person assigns, given the background knowledge 
7 S , the plausibilities 

P(S (1>0 )|/ s )*0, 

PCS (1/2,1/2)1 A) « 1, (35) 
P(S (0> i)|/ s )*0. 

Then, according to the rules of plausibility theory, the total plausibility assigned to 
heads is 

P('heads'|/ S ) = £P( 'heads'^ A 7 S ) P(^|/ s ), 

q 

= lxP(5 ( i,o)|/s) + l/2xP(5 (1/2il/2) |/ s ) + OxP(5 (0 ,i)|/ 8 ), ( 36 > 
-1x0 + 1/2x1+0x0 = 1/2. 

The fact to notice here is that amongst all the circumstances, those that lead to a con- 
ditional plausibility for heads near to the total plausibility, viz. 1/2, have together far 
higher plausibility than the others. This fact can be interpreted as the source of the 
feelings of sureness and stability associated to that final plausibility assignment: the 
person has conceived a number of hypotheses, and the total plausibility assigned on 
the grounds of these is practically equal to the plausibilities assigned conditionally 
on the most plausible hypotheses only. Put it otherwise, the most plausible circum- 
stances are not discordant, all point more or less to the same plausibility assignment. 
Mathematically this is reflected in the fact that the distribution P(>S^|/ S ) is concen- 
trated around the median. 



18 Note, once more, that the S g concern not plausibilities but statements like, in this case, 'The coin 
is a usual one and the tosser knows nothing about tossing methods', etc. 



22 



Porta Mana, Mansson, Bjork 



The Laplace-Jaynes approach to induction 



The reasoning of a person who feels 'unsure' about an assignment 1/2 to heads 
in the situation 4 might go instead as follows: 'Had I known that the coin was a 
common one, I'd have assigned 1 /2 to heads. But I have heard from reliable sources 
that this coin is not a common one, so I can safely exclude that possibility. If I 
knew that the coin was two-headed, I'd assign 1 to heads. If I knew that the it was 
two-tailed, I'd assign to heads. But I'm completely unsure about these two possibil- 
ities ! I have to give 1 /2 to heads then' . Also in this reasoning the person introduces a 
collection of hypotheses and from these distills a set of three plausibility-indexed cir- 
cumstances similar to those introduced by the 'sure' person. It is in the assessment of 
the plausibilities P(Sq\ I) that the two persons — owing to their different background 
knowledges I s , 4 — differ. In this case the assessment yields 

P(S ( i >0 )|/u)*l/2, 

P(S(l/2,l/2)|/u)*0, (37) 

P(5 ( o,i)|/u) * 1/2, 

and therefore the unsure person assign to the outcome 'heads' the total plausibility 

P('heads'|/ U ) = ZP('heads'|S$ A/ u ) P(5^|/ u ), 

i 

= 1 xP(5 (1,0)14) + 1/2 xP(5 (1/2,1/2)14) + 0xP(5 (0>1) |/ u ), ( 38 ) 
-1x1/2+1/2x0 + 0x1/2 = 1/2. 

The final assignment is the same as that of the sure person, but the most plausible 
circumstances for the unsure person would yield conditional plausibilities not near 
to the final value 1 /2. The unsure person would have given or 1 if a little bit more 
knowledge of the situation had been available. 19 In other words, the most plausible 
circumstances are discordant and point to different plausibility assignments, and this 
can be interpreted as the source of the feelings of unsureness and instability. In 
mathematical terms, the distribution P(S q\ 4) is not concentrated around the median. 

The two examples above are very simple, the number of possible plausibility- 
indexed circumstances being only three. With a deeper analysis this number could 



be very large, and the assignments ( |35| ) and ( p7| ) would be replaced by effective 
continuous distributions like e.g. 

p(S g\ I s ) dq oc (<7 heads ) 2 ° (<7tails) 2 ° 5(1 - <?heads - flails) dq, (39) 
p(S q \ 4) dq oc (q heads y 39/40 (q^sT 39 ' 40 5(1 - <7 heads - q tails ) dq, (40) 

whose graphs are those of Fig. [IJ blue dashed curve for the first and orange continu- 
ous curve for the second. These are Dirichlet (or beta) distributions, known for their 
particular properties, for which see e.g. jl], |iT, 72, |5[ 108] (cf. also [70]). 



The degree of 'sureness' or 'stability' about a plausibility assignment can thus be 
apparently captured, formalised, and even quantified by the plausibility distribution 



19 The assignment will in fact collapse onto one of those two values as soon as one toss is observed, 
leading to nought-or-one updated plausibilities for the circumstances S(o,i) and 
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of a set of circumstances, P(5^| /), and by its 'width' in particular, which can be de- 
fined and quantified in a different number of ways (entropy, standard deviation, etc.). 
Graphs such as those of Fig. || have then a legitimate and quantitative meaning within 
(single-valued) plausibility theory: they do not represent 'probabilities of probabil- 
ities'; they represent plausibility distributions amongst different unknown empirical 
circumstances, grouped for simplicity according to the conditional plausibilities they 
lead to. 

This interpretation is also consonant with the fact that, according to the results 
of § 5.4, eq. (pi]), as the observed data accumulate the distribution P(Sq\ /) gets more 
and more peaked around a particular value which gets also nearer the median, re- 
flecting the fact that accumulation of data tends to make us surer of our plausibility 
assignments. 

For further discussion and references, see Jaynes [21, ch. 18; cf. also 19, lect. 18; 



20, lect. 5] and 



6.3 Finite and small number of possible observations 

As mentioned in § |] with regard to point (£]), there are situations in which the infinite- 
exchangeability approach cannot be adopted: situations where we know that the num- 
ber of possible measurement instances is finite — and, in particular, small. In this 
case a de Finettian has three possible options as regards the application of exchange- 
ability. 

The first is to make a judgement of finite exchangeability and use the related 
representation theorem. It is known that the representation theorem for finite ex- 
changeability has a different form from that for infinite exchangeability, eq. (^]). The 
plausibility for a collection of outcomes takes in the finite case the form of a mixture 
of hypergeometric distributions ('urn samplings without replacement'). In the case 
of a finite but large number of possible observations this representation converges to 



the infinitely exchangeable one [Q, |10|] [see also [109J], so the de Finettian can make 
sense of the propensitor's 'unknown propensity' reasoning and of the related formu- 
lae, which are formally identical with (^)-(||), as approximations. But in the case of 
a small maximum number of possible observations the discrepancy between the two 
representations is too large, and if the 'propensitor' obstinately uses the 'unknown 
propensity' formulae, the de Finettian will then not be able to 'make sense' of them, 
as instead was the case with infinite exchangeability. 

The second option is to use de Finetti's theorem as generalised by Jaynes [O], i.e. 
extended to generating functions of any sign, but restricting the choice of the latter to 
positive ones only. But this restriction would lack meaningful motivation: What kind 
of plausibility judgement would the restriction reflect? — For we must remember 
that the specification of the generating function is for the de Finettian only a detour 
to specify the plausibility distribution of the outcomes, which is the only 'actual' one. 
The restriction to positive generating functions would only seem to reflect the third 
option, to which we now turn. 
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The third option is to enlarge the finite collection of measurement instances with 
an infinite number of Active ones, and then make a judgement of infinite exchange- 
ability for the enlarged collection. With this artifice the formulae of the propensitor 
can be recovered, and seemingly with a meaningful motivation. It seems to us, how- 
ever, that the introduction of very many (oo) Active measurement instances, apart 
from being unpleasant, is formally inconsistent. For the fact that one knows that the 
number of measurement instances can but be finite, say at most n, must be included 
in the context /; viz., A I is false for r > n. The plausibility of the outcomes 
of the Active 'additional' (t > n) measurements, which is conditional on /, is then 
by definition nought, or undefined. One cannot even say 'Let us suppose that ad- 
ditional measurements were possible', because such a proposition conjoined with / 
would yield a false context and the plausibilities conditional on it would then be un- 
defined. An alternative would be to eliminate from the context / those 'elements' that 
bound the number of measurement instances, so that the plausibilities of the Active 
additional measurements would neither be nought nor undeAned in the new, muti- 
lated context. But mutilation of prior knowledge — granted its feasibility — can be 
dangerous. 

The Laplace-Jaynes approach, on the other hand, applies unaltered to the case 
of a Anite, even small, maximum number of measurement instances. This is clear if 



we look again at properties (II)-(lV): they nowhere require the instance index r to 
run to inAnity, as cursively remarked directly after their enunciation. Hence, through 
the Laplace-Jaynes approach we can make straightforward sense of the propensitor's 
procedure also in these Anite situations. 

6.4 When is the Laplace-Jaynes approach useful? Making allowance 
for the grounds behind exchangeability assignments: some 
examples 



We have seen in § |Oj that the two approaches are not mutually exclusive but can 
coexist and strongly 'interact'. But in which sense are the two approaches comple- 
mentary? How ought we to decide which approach to choose? Why? There is no 



deAnite answer: as de Finetti warns [ 93 , § 20] : 



in certain enterprises it can be better to evaluate the probability of favour- 
able outcomes all in a block, to see at a glance whether the investment 
is secure or insecure, and in others it is better to reach this conclusion 
starting from an analysis of the individual factors that are in play. There 
is no conceptual difference between the two cases. Someone who wants 
to estimate the area of a rectangular held can with the same right esti- 
mate the area directly in hectares, or estimate the lengths of the sides 
and multiply them: reasons of convenience, practicality and custom will 
make one method preferable to the other, the one that seems to us more 
trustworthy in relation to our capacity to judge, or we can follow both 
methods, or try other ones, and consider all of them in Axing our opinion. 
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In assigning a plausibility to some measurement outcomes we may simply want, 
depending on 'convenience, practicality and custom', to assign plausibilities to all 
possible collections of similar outcomes (indirectly, by specifying the generating 
function of de Finetti's theorem), update the plausibilities according to observations 
already made, and then marginalise for the instances of interest, as described in § ||[ 
Or we may want to analyse the contexts of the measurements into more details and 
assign plausibilities to these, obtaining the final plausibilities for the outcomes of in- 
terest by the theorem on total probability, as described in § ||. Neither approach is 
more 'objective' than the other, for both must start from some set of (subjective, the 
de Finettian says) plausibilities that must be given without further analysis. 

In situations such as that of a 'common' coin toss or die throw, we (the authors) 
usually prefer in general the approach through exchangeability. Although such situ- 
ations could be analysed into more details and hypotheses about the way of tossing 
etc., we are not really interested in these and prefer to make an initial exchangeable 
plausibility assignment for future tosses, that we shall update according to observa- 
tions made — the number of which can be very large (i.e., infinite exchangeability 
can be applied, at least as an approximation). One could say that it is the plausibil- 
ities 'P(/?| T) | . . . )' that are of interest, and we collect measurement data to be put in 
place of the dots in order to update those plausibilities. But there are examples, espe- 
cially in the study of natural phenomena, where it is the details — the circumstances 
— that interest us most and that we want to investigate. In these cases measure- 
ment outcomes are collected not only for the sake of prediction of other, unknown 
ones, but also in order to modify our uncertainty about the circumstances, i.e., to 
change our initial plausibility assignments about them. In other words, the plausibil- 
ities 'P(C/| - - - )' interest us as much as the 'P(/?[ T) | ...)', and measurement data are 
collected to update both. 

A powerful feature of the approach through plausibility-indexed circumstances 
is that you can 'wait' to think about and to explicitly define the circumstances of 
interest until you have collected a large amount of observations. The 'procedure' is 
as follows: 

1. Imagine to have introduced numerous circumstances {Cj}, to have assigned the 
plausibilities P(/?;| M A Cj A /), and to have performed the 'coarse-graining' of 
the circumstances into the set {S q }. 

2. Assume to have assigned plausibilities P(C/| /) such that the plausibility den- 
sity p(S q \ I) over the plausibility-indexed circumstances is enough smooth and 
non-vanishing at every point. How much is 'enough' is related to the size of 
the collected data as for the next step. 

3. Collect a large amount of outcome observations D. What counts as 'large' 
is related to the smoothness assumed in the previous step for the plausibility 
density over the plausibility-indexed circumstances. 
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4. Update your distributions according to the observations D. If / = (ff) are the 
relative frequencies of the observed outcomes, the updated plausibility distri- 
bution over the plausibility-indexed circumstances would now be concentrated 
around the circumstance labelled by /, i.e., p(Sq\D A I) dq « 8(q - f)dq; 
cf. § 5A. This means that you would judge S f to be the most plausible cir- 



cumstance given the evidence D. (But remember: the introduction of circum- 
stances is up to now only imagined.) 

5. Now re-examine the context as it was before the observations, and really intro- 
duce a set of circumstances of interest, respecting the assumptions in steps [I] 
and ||[ In this process, however, you only need to concentrate on those circum- 
stances conditional on which the outcomes of any measurement instance are 
near to /, i.e. those for which P(/?,| M A C; AT) « fi. This restriction is sensible 
since, as for step f| above, these are the circumstances that have now become 
most plausible in the light of the observed data. 

This process reflects e.g. what we do when, having tossed a coin many times and 
seen that we nearly always obtain heads, we say 'there's something peculiar going 
on here, let's see what it can be', and begin to look for circumstances (like who 
tossed the coin, how the coin was tossed, on which surface it was tossed, how it was 
manufactured, etc.) which presumably lead to the observed peculiar behaviour. 

Examples in the same spirit, although more complex, abound in the natural sci- 
ences. Consider the following example from the study of granular materials [e.g., 
1 10| , 111]. These are made of a very large number of macroscopic (thermal agitation 



is unimportant) but small particles, that are hence considered point-like. These ma- 
terials can then be studied by the methods of statistical mechanics. Often of interest 
is the particle motion resulting from an externally applied force and the particles' 



collisions. In the study by Rouyer and Menon [112] (to which we refer for details) 
we are interested in the measurement — our M — of the horizontal velocity com- 
ponent v of any particle. The possible measurement outcomes {R v } are the possible 
different values of v. 20 What is the plausibility distribution that we assign to the 
different outcomes? Depending on the circumstances, we should make different as- 
signments. For example, if we knew that the particle collisions were elastic, the 
particle density enough low, long-range interactions negligible, and a couple more 
of details — all of which we represent by Cmb — » we should assign a Maxwell- 
Boltzmann, Gaussian distribution p(R v \M A Cmb A /)dv oc exp(v 2 /v 2 )dv, with an 
appropriate constant vo- (In the present case we know the collisions to be inelastic, 
so that Cmb is from the beginning known to be false; but let us disregard this to make 
the example more interesting.) If we knew [ |112| ] that the collisions were inelastic, 
knew that the density presented inhomogeneities, judged the particle density very 
relevant for the velocity distribution, and knew some other details — denote all them 
by Cp — , we should then assign a particular non-Gaussian plausibility distribution, 



20 Hence, the set of outcomes is continuous here but, with appropriate mathematical care, this is not 
so important. 
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according to the kinetic-theoretical considerations of Puglisi et al. Ql 13| ]. Finally, if 
we knew [ ]1 12| ] that the collision were inelastic, that energy was injected in the sys- 
tem homogeneously in space, and some other details — call them Cne — , we should 
then assign another particular non-Gaussian plausibility distribution, according to 



the study by van Noije and Ernst [ |114] ]. (In the present case we know that energy 
is injected from the boundary, so that Cne is known to be false; again, we disregard 
this.) 

We do not consider other possible circumstances for the moment, but go on to 
collect a large amount of observations instead — call these D. The distribution of 
frequencies thus observed goes like N v /Ndv - exp(v a /v*) dv with a « 3/2, a distri- 
bution very similar to that assigned on the grounds of Cp, and partially similar to that 
assigned conditional on Cne- By eq. ( |l2| ) we are thus led, in both the exchangeability 
and the Laplace-Jaynes approach, to assign 

p(/?f +1) |M (W+1) A D A 7)dv * exp(v a /vf )dv (41) 

and to say that 

r('exp(v a /vf)'|D A/) =p(S 'exp(v a /v?)' \D A I) has a very large value. (42) 

You note that T has 'exp(v a /v*)' as argument, which is also the index of 5. This 
is because v is a continuous variable, hence the vectors q = (qt) of our previous 
discussions become functions Q(v) here. The function 'exp(v a /v*)' thus represents 
a particular value of what is denoted by l q' in §§ ^] and ||. 

The analysis has up to now involved both the exchangeability and the Laplace- 
Jaynes approaches, and at this point the exchangeability approach basically termi- 
nates. But the interesting part of the Laplace-Jaynes approach, instead, begins. In 
fact, from eq. (|2|) we are led to assign a vanishing plausibility to the circumstance 
Cmb and a small one to Cne (which we already knew to be false, however) and a 
non-negligible plausibility to the circumstance Cp. This prompts other studies and 
experiments in order to update that plausibility so as to possibly decrease our uncer- 
tainty. Rouyer and Menon [ ]1 12| ] actually do this, and eventually come to a vanishing 
plausibility for the circumstance Cp. Other circumstances are still to be formulated 
and introduced. 

In the above typically physical example the 'circumstances' are indeed intended 
as 'causes' or 'mechanisms'. But as we remarked in § |5.1[ , this needs not always 
be the case. A very interesting and curious example, for which we believe the cir- 
cumstances cannot be interpreted as 'causes' or 'mechanisms', is provided by the 



study of the Newcomb-Benford law [115, 116, 117, 118], here tersely described by 



Raimi [117, p. 521]: 



It has been known for a long time that if an extensive collection 
of numerical data expressed in decimal form is classified according to 
first significant digit, without regard to position of decimal point, the 
nine resulting classes are not usually of equal size. Indeed, [. . . ] for 
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the occurrence of a given first digit i (i = 1,2,..., 9), 21 many observed 
tables give a frequency approximately equal to log 10 [(i + 1)//]. Thus the 
initial digit 1 appears about .301 of the time, 2 somewhat less and so on, 
with 9 occurring as a first digit less than 5 percent of the time. (We do 
not admit as a possible first digit.) 

This particular logarithmic distribution of first digits, while not uni- 
versal, is so common and yet so surprising at first glance that it has 
given rise to a varied literature, among the authors of which are mathe- 
maticians, statisticians, economists, engineers, physicists and amateurs. 

This example can be analysed along the lines of the previous one. The 'measurement' 
M is the observation of the first digits in a given collection of numerical data, the pos- 
sible 'outcomes' being R\,...,Rg. Having made a judgement of exchangeability as 
regards the outcomes of any number of observations, 22 after many observations our 
plausibility assignment will be very near to the observed frequencies, in this case 
approximately (log 10 [(/ + 1)//]). Here the approach through exchangeability stops. 



But the Laplace-Jaynes approach does not. According to eq. (31) the circumstances 
that have acquired highest plausibility are those which lead us to assign the 'sur- 
prising' distribution (log 10 [(z + 1)//]). The 'varied literature' which Raimi mentions 
consists almost exclusively in searches and studies of such circumstances. The sig- 
nificant point is that many proposed ones concern not 'causes' or 'mechanisms', but 
symmetries. 

A general and more thorough discussion of how the Laplace-Jaynes approach fits 
into the formalism of statistical physics will be given in our next study (in prepara- 
tion), together with an analysis of point ([]) of § |j. 

7 Generalisations 



In the Laplace-Jaynes approach introduced in § 5.1 a set of possible circumstances is 
defined for each measurement instance, and there is a mutual correspondence of these 
sets, mathematically expressed in particular by eqs. (Ill) and (|v|). These express the 



idea that there is a similarity amongst the measurement instances, for they can be 
analysed into the 'same' set of circumstances; and also the idea that the 'same', but 
unknown, circumstance holds in all instances. This approach can be generalised in 
different directions. 

A first generalisation is to introduce a set of circumstances {C^}, not for each 
measurement instance, but for all of them en bloc. I.e., each circumstance Cx con- 
cerns all measurement instances. This idea can be easily illustrated if different mea- 
surement instances are characterised by different times: then each C\ can represent a 



P 

22 Since the number of data is finite, infinite exchangeability can here be used only as an approxi- 
mation or an idealisation. 
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possible 'history' of the details of the instances. 23 Modifying properties and 
the subsequent sections by the formal substitution C. , and dropping prop- 

erty (IV), most part of the analysis and discussion presented in the previous sections 
holds unchanged, or with minor adaptations, for this generalisation. 

A second generalisation, with similarities with the preceding one, is made by 



dropping property (|V|) only. This means that we do not assume the 'same' circum- 
stance to hold at every measurement instance. All conjunctions Ac-Cj , where the 
j a can differ for different cr, can then have non-vanishing plausibilities (these con- 
junctions are obviously similar to the circumstances C\ above; cf. footnote El). With 



this generalisation formulae fl20|)-([22|) do not hold; in their place we have 
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The plausibility distribution for the collection of outcomes is therefore generally not 
exchangeable (the frequentist and the propensitor would say that the events are not 
'identically distributed', though independent). 

The plausibilistic framework of the last generalisation is used in non-equilibrium 



statistical mechanics Qll9| , |120[ [121], |122| , |123| ]. A generic circumstance C^ T) repre 



sents a system's being in a ('microscopic') state j at the time t r ; R. represents the 
obtainment of the ith outcome of a ('macroscopic') measurement M^ T) performed at 
time t r ; the plausibilities P(^ T) | M (T) A C (T) A /) are given by the relevant physical 



23 Note, however, that different measurement instances do not need to be associated to different 
times in general, so the 'history interpretation' is just an example. A more general way to think of the 
circumstances jC^) is to introduce for each measurement instance t a set of circumstances [C ), as 



5.1 



but without assuming any correspondence amongst the sets, not even the same cardinality. 
We then consider all possible conjunctions f\ T Cv \ and each such conjunction can be taken to be by 
definition one of the . 
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theory for all i, r, j. We initially assign, usually by means of maximum-entropy or 
maximum-calibre principles, an initial plausibility distribution P(/\ a C ■ \I) for all 

possible state dynamics /\ a C . that the system may follow. Then we update, from 
the history of the observed measurement outcomes, the plausibilities of the different 
dynamics and thence those of unobserved outcomes. 



Finally, a third generalisation, for which see QlOq , |107| ], is the introduction of 
multiple kinds of measurements instead of a single one M. The resulting representa- 
tion, which should be easy for the reader to derive, would correspond to de Finetti's 



theorem for partial exchangeability [11, 124]. 



8 Conclusions 

Beside exchangeability, there is another point of view from which the de Finettians 
and the plausibilists can approach the question of induction and thus also interpret 
and give meaning to the formulae of the propensitors and some of their locutions 
like 'unknown probability' or 'i.i.d.'. This point of view, which we have named 
the 'Laplace-Jaynes approach', is based on an analysis of the particular situation of 
interest into possible 'circumstances'. These 'circumstances' are propositions that 
concern details of an empirical nature; in particular, they are not and cannot be state- 
ments about probabilities. They do not necessarily involve notions of 'cause' or 
'mechanism', but can concern symmetries of the situation under study, or conse- 
quences of the observed events. More generally, their choice is fully 'subjective', as 
are the plausibilities assigned to them. Their main required property is that when they 
are known they render irrelevant any observational data for the purpose of assigning 
a plausibility to unobserved events. 

The Laplace-Jaynes approach can coexist with that based on exchangeability, 
and can be a complement or an alternative to the latter. It is particularly suited to 
problems in natural philosophy, whose heart is the analysis of natural phenomena 
into relevant circumstances (concerning especially, but not exclusively, the notion 
of 'cause'). It also allows — without resorting to alternative probability theories 
— an interpretation, formalisation, and quantification of feelings of 'uncertainty' 
or 'instability' about some plausibility assignments. Finally, this point of view is 
applicable in those situations in which the maximum number of possible observations 
is bounded (and, especially, small) and infinite exchangeability cannot therefore be 
applied. 

The Laplace-Jaynes approach is also straightforwardly generalised to the case 
of more generic sets of circumstances, the case of non-exchangeable plausibility as- 
signments (with applications in non-equilibrium statistical mechanics), and the case 
in which different kinds of measurements are present, usually approached by partial 
exchangeability. 
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